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Abstract 

We study perturbations of the Erdos-Renyi model for which the 
statistical weight of a graph depends on the abundance of certain ge- 
ometrical patterns. Using the formal correspondance with an exactly 
solvable effective model, we show the existence of a percolation tran- 
sition in the thermodynamical limit and derive perturbatively the ex- 
pression of the threshold. The free energy and the moments of the 
degree distribution are also computed perturbatively in that limit and 
the percolation criterion is compared with the Molloy-Reed criterion. 

1 Introduction 

Random graphs were introduced more than forty years ago by mathemati- 
cians and have proved since then to be a very useful and versatile concept. 
The most studied example is the Erdos-Renyi model [H], vi^here the edges are 
independent. Balanced vv^ith the simplicity of its definition, the richness and 
deepness of mathematical results are really fascinating. 

On the other hand, it is clear that the Erdos-Renyi model is a poor 
idealization of real netv^orks, those v^^hich pop out naturally in sociology, 
biology, communication sciences,... For instance, the degree distribution (i.e. 
the statistics of the number of edges incident at a vertex) of most of the 
real life examples exhibits statistical, scale-free, properties very far from the 
poissonian behavior predicted by the Erdos-Renyi model 

^ Email: coulomb@spht.saclay.cea.fr 
^Email: bauer@spht.saclay.cea.fr 

^Laboratoire de la Direction des Sciences de la Matiere du Commisariat a I'Energie 
Atomique, URA2306 du CNRS 

1 



Many random graph models are now on the market, some consructed ad 
hoc to reproduce certain desired features needed to fit real data, some con- 
structed according to general principles. Belonging to the second category, 
the Molloy-Reed model [H] concentrates, inside the space of all labeled graphs 
with uniform probability, on the subspace of graphs with an arbitrarily given 
degree distribution. Within this model, many relevant quantities can be com- 
puted analytically, and there is a general percolation criterion given in terms 
of cumulants of the edge degree distribution. 

Our aim is to study another family of random graphs for which explicit 
computations are also possible. The idea is roughly as follows. Suppose 
that to each graph G one assigns a weight u{G). From the weight u one 
can construct another weight w{G) = Y1g'cg''^(^) where the sum is over 
graphs C with the same vertex set as G and edge set included in that of 
G. Conversely, from any weight function w one can extract a unique weight 
function u, but the expression of u in terms of w involves minus signs. 

We shall introduce a model for which the weight m is a counting function 
for certain structures on graphs. This weight u has two further properties : 
first it is permutation invariant, i.e. the weight of a graph does not depend on 
the labelling of its vertices, and second it factors over connected components, 
i.e the weight for a graph with several connected components is the prod- 
uct of the weight of each component. Note that by standard combinatorial 
arguments, these three properties are inherited by the weight w. 

Then we study the thermodynamic finite connectivity limit, when the size 
of the system (i.e. the number of vertices of the graph) becomes large but 
the average number of neighbors of any given vertex has a fixed finite value. 
In this regime, the relevant features of the weights u and w are encoded in 
tree generating functions u and ro respectively. 

The idea is that because u counts less objects than w (which is not true 
for arbitrary w because then u does not have a simple combinatorial in- 
terpretation in general), the generating function u has better convergence 
properties than the corresponding generating function It). We shall make the 
(crucial) assumption that the first singularity in the generating function of 
w can be obtained from the functional relation that ties it to the generat- 
ing function of u, without having to know the singularities of the generating 
function for u itself. This is certainly true, as we shall recall later, for the 
Erdos-Renyi model. It is also true order by order in perturbation theory 
around the Erdos-Renyi model for the models we introduce. We shall have 
little to say analytically on non perturbative properties, but the numerical 
simulations are encouraging. 

Under this assumption we are able to give expressions for the free en- 
ergy, the size distribution of connected components and for the percolation 
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criterion and size of the giant component when it exists. The expressions are 
not very explicit, because they involve the function u, which is very compli- 
cated in terms of the original parameters of the model. So on the one hand 
we show how to perform explicit perturbative computations of the physical 
quantities and on the other hand we introduce an effective model for which 
the relationship between u and u is directly computable, 

Our motivations are the following. First the models we study form a natu- 
ral and reasonnably manageable family of random graph models. Our point is 
to emphasize the connection with quantum field theory. We do not claim that 
the relation is very deep, but many random graph phenomena have quantum 
field theory counterparts, and quantum field theory gives a very convenient 
language and insight. Second, one of the interests of studying models with 
non trivial degree correlations is that attacks (see e.g. [Z|,[H1) automatically 
induce such features, even if they were absent to begin with. Third, at a 
more basic level, we can contrast with the Molloy-Reed model. This is use- 
ful for the purpose of general comparison, but especially because heuristic 
arguments, always based on non explicit assumptions, allow to recover the 
Molloy-Reed percolation criterion whitout using the particular hypothesis of 
the Molloy-Reed model, thereby suggesting that the Molloy-Reed percolation 
criterion has a much wider range of validity. This is probably wrong, and 
the model solved in this paper is definitely not in this range. 

2 The model 

After recalling the elementary graph theoretic definitions, we present our 
basic assumptions. We use the framework of statistical mechanics, i.e. we 
assign to each labelled graph of size N a weight (real positive number), 
which we use as an unnormalized probability distribution. For the Erdos- 
Renyi model, the weight is simply p^^^\l —p) 2 where E(G) is the 

number of edges of G. We shall choose a weight function that depends on 
more detailed local features of the graph, namely the abundance of certain 
geometric motives. 

2.1 A few definitions and notations 

Simple unoriented graphs, connected graphs, trees. A (simple un- 
oriented) graph G is a couple (V, E) where V 7^ is the vertex set and 
E C & V,i j} is the edge set. li V = {1,---,A^} for some 

integer iV, then G is called a labelled graph. The set of labelled graphs of 
size is denoted Qj^. 
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If G is a graph, we denote by V{G) the vertex set of G or the cardinal of 
this vertex set, depending on the context, i.e whether a set or a number is 
expected at that place ^. Similarly, E{G) will denote either the edge set of 
G or the cardinal of this edge set. 

A connected component of G is a minimal graph {V, E') with V <ZV such 
that if (i, j) e X 1/ and e E then j e V and {i,j} G E'. 

A connected graph is a graph which has only one connected component. 
A circuit of G of size s is a sequence (ii, ■ ■ ■ , is) of distinct vertices with s > 3 
such that {ii, i2}, ■ ■ ■ , {is-i- is} and {is, are edges. 

A tree is a connected graph without circuits, and the set of labelled trees of 
size is denoted by T^. 

If {i,j} is an edge of G, we say that i and j & V are neighbours in G. The 
number of neighbours of a given vertex i e in a graph G, also called the 
degree of G at vertex i, is denoted li{G), or k when there is no ambiguity. It 
is the number of elements of E in which i appears. 

Adjacency matrix of a graph, operations on matrices. The adja- 
cency matrix A{G) (or simply ^4) of a labelled graph G G is the N by 
matrix defined by A^j = 1 if {i,j} is an edge of G, Aij = else. Note 
that the set of adjacency matrices is the set of symmetric 0, 1 matrices with 
vanishing diagonal elements. 

The sum of all elements of any matrix M will be written ||M|| = Mij. 
If M is the adjacency matrix A{G) of a graph G, then it is clear that |||M|| 
is E{G), the number of edges of G. 

The sum of all diagonal elements of a square matrix M is the trace of M, 
written Tr{M). Note that \Tr{A{Gf) is again equal to E{G). 

2.2 Probability distribution, partition function 

To emphasize the similarities between the random graph model studied in this 
paper and quantum field theory, we split the weight of graphs in a product of 
a free part and an interacting part. The free weight is wq{G) = q^^'^^ where 
q e]0, -|-oo[. For later convenience, we also introduce p — q/{i + q) e]0, 1[. 
The interacting part is wi{G) = e^'^^\ where 

k k 

^This should cause no confusion, though from a fundamental point of view, a number 
is a set as well. 
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The full weight is w{G) = Wo{G)wj{G). The normalization factor included 
in the definition of the partition function 



GeGN 

is chosen in such a way that Zjv can be expressed as an average over Erdos- 
Renyi weights : 

V^^ - /pSi{G)\ 

where ER{p) assigns probability p [1 — p) 2 ^ to any graph on N ver- 
tices with E edges. We view wq as describing a gas of independent edges (the 
Erdos-Renyi model), and Si as describing the interactions between edges, the 
tfc's and Sfc's being arbitrary real parameters that regulate the abundance of 
certain local geometric features of G. Note that if G is made of several con- 
nected components, Gi, • • • , Gi, Si{G) — Si{Gi) -|- • • • -|- Si{Gi), so that wj 
factors as a product over connected components. This is also true of wq. This 
multiplicativity of the weight plays a crucial role to simplify the analysis be- 
low. Another striking feature is that w{G) is invariant under permutations 
of the vertex set. Many other interactions with this property could be in- 
corporated, for instance by including products of traces and norms into the 
interaction, however these would break the multiplicativity property. The 
"simple" multiplicative model does not seem to be exactly solvable, and we 
have to rely on perturbation theory to make explicit computations. Even the 
"trival" case when all parameters except, say, S3 vanish is already complicated 
enough. This is why we insist on keeping the two properties : multiplicativity 
and permutation invariance. 

The above model is perfectly well defined for all parameter values as long 
as N is finite. However, we shall be interested in taking a large N limit such 
that the average degree is a fixed number (so that the number of edges is 
proportional to N). This will impose some constraints, see below. 

3 Two combinatorial formulae for the partition 
function 

In this section, we shall derive two formulae related to exponentiation in 
combinatorics. 
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3.1 Connected components 

Suppose we consider the grand canonical partition function 

„ w{G) ^y^a^ 



as a power series in z : this is a formal sum over all graphs of any size, from 
which Zn can be recovered as 

Af(iv-i) r dz 

Z^^{l-p)^N\j-^E{z) 

where for the time being, the symbol § -^rrr is not viewed as a real contour 
integral, but simply as the operation of taking the term of order in the 
z expansion of a formal power series. If G has I connected components 
Gi, ■ • ■ ,Gi, we observe that 



wiG) 

-z 



V(G)! (eUV(GJ)!^ 



Summing over G is the same as summing over the connected components. 
Up to now the vertices of the G^ were labelled as subgraphs of G. Thanks to 
permutation invariance, one can instead sum over abstract finite sequences 
of / = 1, 2, ■ • • labelled connected graphs but weight the terms in the sum by 
a combinatorial factor 

i (EUm))! 

I'- n^m)! 

to take into account all possibilities to order them and to label the union 
of their vertex sets from 1 to X]ifc^(^fe)- Then S = where W — 
v'(G)! '^^^'^^ = i^ defined exactly as S except that the sum is 

only over connected graphs (this is what is meant by the symbol XIg)- '^^^ 
main formula of this section, the first exponential formula, relates Zj^ and 
W{z) as 

ZN=il-p)^N\j>-^^e^(^\ (1) 

3.2 Reorganization of the perturbative expansion 

The case when = = for all k corresponds to the Erdos-Renyi model. 
The probability of a graph only depends on the number of its edges, and 
many quantities such as degree distributions, component distributions and 
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percolation threshold take a simple form. When one or more of the t^s and 
Sfc's are not vanishing, the finer structure of the graph becomes relevant, and 
this will be the case of interest in this paper. 

Let us fix G and start from the expansion of e^''^'^^ in powers of t^s and 
Sfc's. This gives a linear combination of terms of the form: 



JJ ((rrA(G') \\A{G)X') ■ 



If we expand each matrix product, such a term becomes a sum of products 
of matrix elements oi A{G) of generic form A{G)i-^^j-^^A{G)i^j^ ■ ■ ■A{G)i^j^ and 
we may assume that ii ^ ji,-''iin 3n because otherwise the product 
is for any (simple graph) adjacency matrix. On the other hand, to any 
sequence iiji ■ ■ ■ injn with ii ^ ji, 7^ j„, we may associate a graph 

H with vertex set [1, A^] and edge set {{ii, Ji}, • ' ' > {^n,in}}- The product 
^(^)iiii^(^)i2i2 ■ ■ ■^(^)inin vaulshcs unless all edges of H are edges of G, 

in which case it has value 1. 

With this observation in mind, we define e^i^^) by keeping, in the ex- 
pansion of e^'^'^\ only those terms A{G)ij^jj^A{G)i2j2 ■ ■ ■ A{G)i^j„ such that 
{{ii,ji}, ■ ■ ■ , {in,jn}} exhausts the edge set of G (maybe with repetitions). 
Then by definition, 

H,E{H)CE{G) 

where the sum is over all graphs on the same vertex set as G w hose ed ge 
set is a subset of that of G. The reciprocal formula is given by e'^^(^) = 
J2G,E{G)cE(H)i~)^^^^~^^'^^^^'^'^^ the multiplicative property of e'^^^'^^ en- 
sures that e'^^(^) is also multiplicative : if H has / connected components 
Hi,---,Hi, 



gS/(if) ^ (>Si{Hi) . ..(>Si{Hi)_ 



Now 



GeQN H,E{H)(ZE{G) 

^ J2 H (i-p)'^-^(^)p^(«)i^. 

HeQN G,E{G)DE{H) 

For fixed H, T.G,E{G)DE{H)i^ - p)^^^-E{G)pE(G) ^ pE{H) ^ ^'^^^ 

HeGN 
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Defining 



u{H) = p^We^^(^) U = E^^-^(^) - E„ 

we can repeat the steps leading from multiplicativity and permutation in- 
variance to eq.(^ to obtain the second exponential formula 

^--'Vl/^^e"'). (2) 

3.3 Consequences of the exponential formulae 

The two expressions obtained for the partition function, one in terms of W{z) 
and the other in terms of U{z), show that for every nonnegative integer N 



It is convenient to eliminate N by the following trick : putting 1—p = e~ 



p) 2 takes the form of a gaussian integral I-^ ^ P 2 



From the change of variable x = ze^ 2 , it follows that 

r. +00 2 |3 



e^^^> = -== / e-^-^^^^^^ '>dy. (3) 

y/2TTfj J_oo 

There is no good reason why e'"^^^^^^ should be integrable in y along 
the full real axis. However, if one expands this function in powers of z, 
term by term integration is ok, and for the time being, eq.Q is a shorthand 
notation for the fact that this term by term integration leads to the formal 
power series of e'^^^^ 



4 Practical perturbative expansion 

Our aim is to organize the perturbative expansion to make explicit compu- 
tations. We would like to make a systematic enumeration of the terms that 
appear in perturbation theory. 

A typical term in the perturbative expansion is of the form Ai-^j-^ ■ ■ ■ Ai^j^ 
to which we associate the sequence iiji ■ ■ ■ injn i-e. a word written using 
the alphabet [1, A^]- For a graph G with adjacency matrix Aij, the product 
^nji^i2i2 ■ ■ ■ ^inju is 1 if {^1, ji}, ■ ■ ■ , {in,jn} are amongst the edges of G and 
else. If / is the number of distinct edges among these n 2-sets, the average 
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is simply the sum of the Erdos-Renyi weights of all graphs containing these 
/ edges. This is known to yield {Ai^j^Ai^j^ ■ ■ ■ Ai^j^^) ^^^^ = This average 
is invariant under permutations of [1,A^], all vertices play the same role in 
TrA{G)'' and ||y4(G')''|| . So we regroup the words iiji ■ ■ ■ injn in classes under 
the action of the permutation group, compute the size of each class and find 
a representative in each class. Then we enumerate the representatives and 
take multiplicities into account. 

The idea is the following : suppose that you have a finite word written 
using any alphabet (i.e. any set of symbols) on, say, N letters. To each letter 
that appears in the word, associate an integer as follows : assign 1 to the 
first letter of the word, then assign 2 to the next new (i.e distinct from the 
first) letter appearing in the word, then 3 to the next new (i.e distinct from 
the first and the second) and so on until all letters appearing in the word 
have been assigned a number, the highest one being, say, v {v is the number 
of distinct letters used to compose the word, which may well be strictly 
smaller than the length of the word, because the same letter can appear 
more than once). Replacing each letter of the word by its number leads to a 
new word, the alphabet being [l,v] C [1,N] this time. The words obtained 
by this procedure are characterized by the fact that 1 appears before 2 which 
appears before 3 and so on. Say that two words in the original alphabet are 
equivalent if they yield the same numerical word by the above procedure. 
Then each class contains (^j^^y^ words. 

In our case, the original alphabet is already [1, A^], and we are led to the 
concept of normalized sequences, an elaboration of a procedure introduced 
in a slightly simpler context in |3|. 

4.1 Normalized sequences 

For an arbitrary sequence iijii2j2 ■ ■ ■ injn (with alphabet [1,A^]) such that 
ii 7^ Ji) ■ ■ ■ ; "^n 7^ jn, wc define V = ji, ■ ■ ■ , in,jn}, the number of distinct 
vertices in the sequence, and / = Ji}, {^2, 72}, ■ ■ ■ ? {^n? in}}, the number 

of distinct edges in the sequence. 

We shall say that a sequence ii jii2j2 ■ ■ ■ injn is normalized with respect 
to Ylk {TrA{G)'') * Ylk ||^(G')'^||"'= or more simply with respect to (mk,nk) 
if 

• n = kC^^k + rik). 

• In this sequence, 1 comes before 2, which comes before 3,... which 
comes before v. 

• il^ jl, ■■■,in7^ jn- 
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• The sequence has a correct structure as regards Tr and || ||. That 
is, to TrA{Gy {ruk = Sp^k,'nk = 0) and ||A(G')^|| {nik = 0,^^ = Sp^k) 
correspond the constraints ji = 1,2, ■ • ■ ,jp-i — ip, with the additional 
constraint jp = ii for TrA{Gy\ If more than one term of {mk,nk) 
is nonzero, then we choose an arbitrary ordering : increasing /c's, all 
traces coming before norms. This allows to decompose the sequence in 
subsequences, which correspond either to a trace or a norm, and are 
accordingly constrained. For instance, when rris — 1 and ni = 1, ^3 = 1 
are the only nonvanishing elements of {mk,nk), the sequence has a 
correct structure if it is of the form iijii2j2hj3 HJi 'i-dsieje'^Tjr where 
ji = i2,j2 = isJs = h and js = ieJe = ij (to ||A(G')|| correspond no 
constraint of structure). 

We write M.v,i,(mk,nk) for the number of normalized sequences with v ver- 
tices and / edges. By our previous remarks, the class containing a normalized 
sequence has members, each of which leads to the same average. Hence 

(n {{TrA{Gfr \\A{Gf\r)) = 7j^,P'M.Mrn.,nky (4) 

\ / ER(j>) v,l ^ ' 

In doing explicit computations, which can be painful, there is a useful 
check of the formula, namely a sum rule corresponding to p = 1, in which 
case only the complete graph contributes, and there is no average to compute. 
It is staightforward to check that if G is the complete graph on N vertices, 

TrA{Gf ^{N - if + {-f{N - 1) ||^(G')'=|| = N{N - if. 

Hence 

v,l ^ ^' k 

4.2 Graphical expansion 

Although the interpretation in terms of normalized sequences is adequate 
for the purpose of numerical computations, there is another useful graphical 
representation of the perturbation series which we present briefly now. 
Expanding TrA'' — Ai^^i^A^^^i^ ■ • ■ A^.n we represent each term as a 

colouring of a labelled cycle on k vertices with colours, vertex j carry- 
ing color ij for j = 1, • • ■ , A". In the same way, we represent each term in 
M^ll = ^ii,i2^i2,i3 ■ ■ ■^ik,ik+i ^ colouriug of a labelled segment 
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on k+ 1 vertices with N colours, vertex j carrying color ij for j = 1, - ■ ■ ,k+l. 
The expansion of e^' in powers of t^s and s^'s, is then represented as a sum 
over colorings, with N colours, of labelled graphs whose connected compo- 
nents are cycles and segments. Pick one term, call it T, in this sum. Each 
cycle of length k yields a factor tk/ (2/c), each segment on A; + 1 vertices yields 
a factor Sfc/2, there is a factor 1/mfc! if there are cycles of length k, and 
a factor l/rife! if there are Uk segments on A; + 1 vertices. 
The probabilistic average of F (over the set of incidence matrices) is zero if 
some edge of F has it's two extremities of the same colour. If not, let e(F) 
be the number of distinct pairs of colors that appear as extremities of edges 
of F. The probabilistic average over the set of incidence matrices multiplies 
the former weight of F by p'^^^'' . 

Let v{r) be the number of distinct colours in the colouring of F. Say 
that terms F and F' are equivalent if there is a permutation of [1, A^] (the 
set of colours) that maps F to F'. The equivalence class of F is made of 
N\/{N — v{T))\ graphs with the same weight. The equivalence class of F has 
a graphical representation : starting from F, draw a dashed line between two 
vertices if and only if they carry the same color. Then remove the colors. 
In this way, obtain a graph with two kind of edges, solid and dashed. The 
graphs that appear in this operation have two properties. First, the solid 
components are cycles and segments, and the dashed components are com- 
plete graphs. Second, two vertices cannot be adjacent for solid and dashed 
edges at the same time. In lack of a better denomination, we call graphs 
satisfying these two conditions (labelled) admissible graphs. The notion of 
connectivity for admissible graphs treats solid and dashed edges on the same 
footing. 

Instead of working with labelled admissible graphs, we may use unlabelled 
admissible graphs. Then the combinatorial factors {2k for a /c-cycle, 2 for a 
k + 1-segment, a factorial for permutations of components of the same type 
and size) which take into account only solid edges, are replaced by the order 
of the symmetry group of the admissible graph, the group of permutations 
of vertices that preserve solid and dashed edges. 

Let H be an admissible graph. Two vertices being declared equivalent if 
they are connected by a dashed line, let v{H) be the number of equivalence 
classes of vertices. Two edges being declared equivalent if their extremities 
are equivalent as vertices, let e{H) be the number of equivalence classes of 
edges. Furthermore, we denote by s{H) the order of the symmetry group of 
the graph. Then 
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Figure 1: Graphical enumeration/interpretation of the contribution of TrA^ 
to the partition function. On the left are drawn all possible identifications, 
schematized by complete dashed graphs. On the right identification has been 
carried out. The numbers of symmetries, vertices and edges are given for each 
graph. 



where the sum is over unlabelled admissible graphs H, mk{H) is the number 
of solid /c-cycles and nk{H) the number of solid segments on A; + 1 vertices. 
See figure n] for the example of TrA^. 

4.3 Exponentiation 

Computations on a sheet of paper are more economical using admissible 
graphs, but systematic machine enumeration is best carried out using nor- 
malized sequences. 

If is not difficult to convince oneself that the notion of connectedness of nor- 
malized sequences or of admissible graphs is the same, and coincides with 
the notion of connectedness used to establish eq.©. We infer that 



where is the sum over unlabelled connected admissible graphs or equiv- 
alently that 



'k ' 



U{z) = Z + 



v,l,{m, ,n, } 
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where f^'s^' = Ylk (||)™'° (^)"'° ^^^^ -^li {mk n^) number of normal- 

ized connected sequences. 

5 Finite connectivity large N behaviour 

Our aim is to use the identities eqs. ()H2|) and their consequence eq.Q to 
derive mean field type identities valid in the limit iV — > oo, pN,tk, Sk being 
independant, or more generally having finite limits for large A^. It is 
customary to define a = limN^oopN. 

5.1 Caveat 

In this regime, when tk = Sk = for all fc's (the Erdos-Renyi model), the 
event that a graph contains a component with much more edges than ver- 
tices has a vanishingly small probability, and the connected components look 
locally like trees. This is called the dilute regime. In particular, the complete 
graph (a caricature of a non dilute graph) has a negligible weight in the dilute 
regime. 

We are looking for an analogous regime for the perturbed Erdos-Renyi 
model. However, in that case, the complete graph has weight 

w = g^(^-i)/2eEfc^((^-i)"+(-)"(^-i))+Efc^^(^-i)\ 

Compare this to a union of isolated vertices (a caricature of a dilute graph), 
which has weight w = 1. Consider for instance, the case when there is 
only one nonvanishing perturbation parameter, say S3. The two weights are 
equal if qe^^^^~^^ = 1. That this crude balance gives the correct qualitative 
frontier between a dilute regime and a dense regime is confirmed by numerical 
simulations. 

If p = a/N and S3 < 0, the complete graph is indeed strongly suppressed 
(in fact much more drastically than for the pure Erdos-Renyi model). How- 
ever, if S3 > 0, the weight of the complete graph submerges the weight of 
dilute configurations. 

To summarize, the following discussion makes sense only if 5*/ does not 
become positive and large N'^) for non dilute configurations. An easy 
way to ensure that is to take the sign of all perturbations negative. Another 
possibility would be to take only a finite number of nonzero perturbations, 
and then impose that the dominant one be negative. 
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5.2 Main equations 

With these observations in mind, we start from 

and 

Recall that p = + g) = 1 — e~^. We fix p = pjv = a/N where a is 
a constant, and make changes of variables y y/p, x ^ x/q in the above 
integrals. In the expansions 



and 

the sum is over connected graphs, and by Euler formula, E — V — L — 1 
where L > is the number of loops. Hence we may write formaly 

TV 

Uiyip) = —u{y) + o(N) W(x/q) = —tv(x) + o(N), 
a a 

where u{y) = '^-^'^'^'^'^ v^ry. = ^^'^'^^ v^{t)\ sums over trees 

(connected graphs with L = 0) . If we use a naive version of the saddle point 
approximation and write Zjv = e^^~^°^^\ we find 

F = _i_iog^ + lu(y*) (5) 

a a 

F = _^-i-log^ + -ro(x*) (6) 

Mx) = -— + u{xe'), (7) 

where the x*,y* and z are appropriate saddle point values : 

a = x*tt)'{x*) = y*u'{y*) z = xe^u'{xe^). 

We end this section with the following remarks. The average number of 
edges is more or less the variable conjugate to p. More precisely, the average 
number of edges is 
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We infer that in the thermodynamic regime with N oo and = a/N, 
the average number of neighbors of a given point (i.e. 2/N times the average 
number of edges) is 

dF 

c = a + 2a— (8) 
da 

For the pure Erdos-Renyi model, the weights form a probability distribution, 
Zp^ = 1, F = 0, and c = a. In the perturbed models, a is not so easily 
measured on the graph, and only the parameter c has direct physical meaning. 
From the point of view of quantum field theory, it is natural to view a as 
the bare connectivity and c as the physical connectivity. For each coupling 
constant tk or Sk, it would be desirable to find analogous physical quantities 
that first, one can compute directly on a random graph without knowing a 
priori the sampling measure and that second one can reduce tk or Sk to first 
order in perturbation theory. This is very ambiguous and we have not found 
an elegant way to select such physical observables systematically. 



5.3 Discussion 

We have seen before that a dilute regime for the perturbed Erdos-Renyi 
model with fixed values of the t^s and s^'s cannot exist if Sj becomes large 
positive for graphs with many loops. Here we discuss a related limitation 
even if one considers only loopless graphs. 

Instead of considering the complete graph, look at the star shaped tree on 
n vertices, whose adjacency matrix we denote by S, with a center connected 
to the n — 1 other vertices. From TrS = 0, \\S\\ = TrS^ = 2{n — 1), 
IIS"^!! = n{n — 1) and S'^ = {n — 1)S, it is easy to compute recursively that 
7^^^2fc+i _ Q 115^^+1 II = 2{n-lf+^ for > 0, and that Tr^^fe _ 2{n-lY 
and II 5'^'^ II = n{n — 1)^ for A; > 1. As an example, consider again the case 
when there is only one nonvanishing perturbation parameter, say S3. The 
contribution of star shaped trees to ro is Ylm (n-i)! ^"^^ trees give 
a positive contribution to w, no compensation is possible and we conclude 
that if S3 > 0, the series for w has a vanishing radius of convergence. So it 
is meaningless to deform contours, and eq.® is meaningless as well. Then 
so is eq.lini because analyticity of \x{y) at small y implies analyticity of Xo{x) 
at small x via eq.((Zl). On the other hand, if S3 < 0, the star-shaped trees of 
large size are very strongly suppressed. Let us note however as shown in the 
next section that, in the realm of formal power series, eq.© describes the 
correct combinatorial relationship between u and ro even if both series have 
a vanishing radius of convergence. 

More generally, if Si{T) /V{T) is bounded above (an easy way to ensure 
that is to take the sign of all perturbations negative, another possibility would 
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be to take only a finite number of nonzero perturbations, and then impose 
that the dominant one be negative) , tt) is analytic near the origin. Indeed, if 
Si(T)/V(T) < T for all trees, using the fact that there are n"~^ labelled trees 

n — 2 

on n vertices, we see that < ^ < j— e™, leading to a nonzero radius of 
convergence. 

For instance, when the sign of every perturbation is negative, the radius 
of convergence is a nonincreasing function of the t^s and s^'s : it gets larger 
and larger as the t^s and s^'s get more negative. To see that it remains finite, 
consider the linear graph on n vertices, whose adjacency matrix we denote 
by L. For this graph, for fixed k and large n, TrL^ and \\L^\\ grow at most 
linearly with n : they count k steps walks, and if the starting point is given, at 
each step there are at most two choices, so there is the obvious upper bound 
n2''. There are n!/2 ways to label the linear graph (the symmetry group is 
of order 2). So the contribution of the linear trees to ro decreases at most 
geometrically with the size. As all trees give a nonnegative contribution, ro 
has its first singularity on the real positive axis, and at a finite distance. 

In the situation when tn has a finite radius of convergence, we conclude 
that there is a forest-like regime for the perturbed Erdos-Renyi model that 
extends the forest-like regime of the pure Erdos-Renyi model, and that it 
is described by the equations eqs. (l5l6f7jl . at least in the small a phase. We 
shall elaborate on this point in the sequel. 

5.4 Combinatorial remarks 

As we have seen before, the above formulae for the free energy rely on crucial 
assumptions. What we would like to show in this subsection, before embark- 
ing on a detailed discussion of analytic features of these equations, is that the 
combinatorics embodied in eq.Q is correct. Suppose that we forget about 
the random graph model for a moment, and consider instead a random forest 
model, w being the generating function for random weighted trees. 

Expand the function u{xe^) in powers of z : u{xe^) = ^ (^^) ^i^)- 
It is well-known from quantum field theory that the formal expansion of 




is a weighted sum of all connected Feynmann graphs. The weight of a Feyn- 
mann graph is computed as follows : each edge gives a factor h (propagator) , 
each vertex of degree / gives a factor (a;^)'u(a;) and finally one divides 
by the order of the symmetry group of the graph. The logarithm is given 
by the same sum, but restricted to connected graphs. For connected graphs, 
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the power of h is the number of loops minus 1, so the dominant contribution 
in the small h limit restricts the sum to connected loopless graphs, i.e. trees. 
On the other hand, the small h limit is given by the saddle point approxima- 
tion, i.e. eq.©. So ro is a sum over all trees, each vertex of degree / giving 
a factor (x^) u(x). 

But u(a;) itself is a tree generating function, so (x^)'u(x) is the gener- 
ating function for trees with I marked vertices (a vertex can be marked more 
than once). So eq.© means that to construct It), one takes arbitrary trees, 
(call them naked trees) and then blows up every vertex of degree / into a 
new tree with / marked vertices from which naked edges emerge. Note that 
a naked vertex can be blown up in a trivial tree, corresponding to the term 
X in u{x) = X + ■ ■ ■. 

As we have emphasized before, if T is a tree, each term t in the expansion 
of e'^^*^^-' in terms of matrix elements of the adjacency matrix A of T defines a 
subgraph of T i.e. a forest with the same vertex set as T, edge {i,j} G E(T) 
being present in the forest if and only if the term t contains the factor Aij or 
Aji. But the connected components of the forest being given, one reconstructs 
tt) by connecting the different components with appropriate edges. This is 
exactly the procedure described by eq.(|Zj) if ro is the generating function for 
g5/(r) ^j^^ ^ Qj^g e^'^'^h 

5.5 Effective model 

If we have the original model in mind, each is itself a highly nontrivial 
kind of partition function. However, if we take each as an independent 
parameter, we can make a rather general analysis. In fact, there is a simple 
model for which the u^'s are the fundamental microscopic parameters in the 
sense that they appear directly in the definition of the weights. We call this 
model an effective model for the following reasons. 

In quantum field theory, the term "effective" often means that one re- 
nounces to deal with all observables of a system and only concentrates on 
certain degrees of freedom, so that the other ones can be averaged. For in- 
stance, to compute the long distance behaviour, one first averages over the 
short distance fiuctuations. We are going to do something analogous here : 
we renounce to observe the local structure of connected components and are 
only interested in the distribution of their size. So instead of keeping track 
of the weight of each detailed connected component, we can as well give all 
components of a given size the same weight, namely the average weight given 
by the original model for components of that size. 

Now, to the precise definition. Choose parameters Ci,C2,- ■■ and define 
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an effective weiglit u^^^^\H) = p^^^^Ck for any connected graph H of size 
fc, and assume multiplicativity, so that for an arbitrary graph u'^^^f\H) = 
pE{H) Yli^c^''^^^ where nk{H) is the number of components of size k of H. 
If we trade Ck for X'^Ck, we multiply the weight u^^^^\H) by a trivial factor 
SO we shall assume that the c^'s are normalized by ci = 1 (the special 
case ci = would need a separate treatment). Define the corresponding 
effective weight w'-^^^^G) = q^^^^ J2h,e{h)ce{g)^''^"KH)p-^'-^\ where the 
sum is over all graphs on the same vertex set as G whose edge set is a 
subset of that of G. Note that contrary to the weight u'^^^^\ the weight 
uj^^ff^G) does in general depend on the detailed structure of the graph, 
and not only on the sizes of connected components. Our interest however 
is in the distribution of sizes of connected component of graphs of large 
size N ^ oo sampled using the weight w^^ff\ Following the same steps 
as for the original model, we find that this distribution can be obtained in 
the thermodynamic limit from tree generating functions u^*^-^-^-* and Vo'^^^^'^ 
satisfying the very same coupled equations eqs. (j5f6f7jl as the original u and 
tt). The coefficients of u'^'^-'^-'^^ are very simple in terms of Ci, C2, ■ ■ ■ because all 
components of the same size have the same weight, and by Caley's theorem 
there are trees on k vertices. Hence u'^^-^-^^^ = X]a,>i ^F'^fc?/'^- Hence if 

one sets Ck = J^TeTk ^^'^'^^ where the sum is over trees of size k, the 
effective model has the same component size distribution as the original one. 

For all these reasons, we shall remove in the sequel the superscript ^^■^■^^ 
from u^^'^'^) and n)^^-^-^\ even if we sometimes keep the distinction between the 
weights u and w and the effective weights u^'^^^'> and w^^^^\ Accordingly, we 
shall analyse eqs. (l5l6l7|) . which involve only the component size distribution, 
without making explicitly the distinction between the original model and the 
effective model. 



5.6 Connected components and percolation 

We return to the finite N arbitrary p case to start the argument. As Wk is, 
modulo an overall multiplicative factor, the total weight of connected graphs 
of size A;, we infer from eq.lQJ) that the mean number of connected components 
on k vertices is 

WkdZu 



iV, 



ZNdWk 



N\ Z^_k,^ iV(]V-l)-(iV-fc)(]V-fc-l) 



k\{N-k)\ " Zjv 

Taking into account that when pN = a, p{N — k) = a{l — k/N), we find 



that in the dilute regime, for fixed k and ^ 00, ^^—^ ~ e and 
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AT I AT ^k-l-k(a+F+a^) 

k'. 

As expected, in this regime only trees contribute thermodynamically to the 
finite components. 

From these equations for the abundance of connected component of each 
size, we can easily derive a percolation criterion. Indeed, by construction, 
Ylk^-^k/N = 1, but what about the approximate sum 



k\ 

k 

For each fixed k and iV — > cxo, the k^^ term is a good approximation to 
kNk/N, but there is problem of inversion of limits. Physically, the approxi- 
mate sum counts the fraction of points in components of finite size, so it is 
< 1. 

If we assume that u is analytic at small y, then F is analytic and small at 
small a and ro is analytic and small at small x. Moreover, x* is an increasing 
function of a at small a. From eq.(ini), we infer that F + = —a — 

log^ or equivalently, ae""-^-"!? = x*. Then fc^a'^-ie-K^+^+^lf ) = 
^w'{x*) = 1 for small enough a. However, it may happen that as a function 

of a, X* = ae~"~^~"^ is non monotonic. There may be a value Oc such 
that X* increases in the interval [0, aj but then starts to decrease, so that 
x*{a) < x*{ac) in some interval strictly containing [0,ac]- One could build 
models where x*{a) has several oscillations, but in the sequel, we concentrate 
on the first. For a given a, denote by a < the small solution to the 
equation x*{a) = x*{d). Then we obtain the more general result that finite 
components occupy a fraction —tv'{x*) = - < 1 of the sites in the system. 
If a > ttc, something else than finite components, in fact on general grounds 
one single giant component, occupies a fraction 1 — f vertices. Thus, the 
percolation criterion is that ae~°'~^~°'^ is maximum at a = ac- So the 
transition point is when 

dF d'^F 

The first two terms yield simply the true average connectivity c = a+2a^, it 
would be nice to have a direct physical interpretation of the third term a^^-. 
This percolation criterion is expressed solely in terms of the free energy as a 
function of a. But it can also be related to analytic properties of W. Indeed, 
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the relevant saddle point equation is a = x*w'{x*). As a approaches Oc, x* 
reaches a maximum, so that the x* derivative of x*xv'{x*) has to get large, 
diverging at a = Oc- If «c is finite, this means that xv and tn' are finite at 
a = ac, but ro" is infinite. If the coefficients of ro are non negative^, this 
means that x*{ac) is the radius of convergence of ro. From x*{ac), we recover 
ac itself by the general saddle point equation a = x*rv'{x*). 

Suppose now (we shall soon argue that this is true in many cases including 
interesting ones) that even if x*{ac) is the radius of convergence of in, the 
function u is not singular at y*{ac). Hence u allows to compute the free 
energy F and show that it is analytic in some interval strictly containing ac- 
From that point of view, we observe that the saddle point equations imply 
that X* = y*e~°' = ?/*e~^*" *^^*^ from which the percolation criterion, i.e. the 
determination of the maximum of x* becomes 

y*^ = y*n\y*)+y*'n"iy*) = l. (10) 

In the same spirit, the true average connectivity can be expressed as c = 
y*n'iy*) + 2-2^y 

In general, if u has nonegative coefficients and eq. lfTTIl) has a solution 
strictly within the disc of convergence, one can go through the above argu- 
ment in the reverse order to prove the existence of a percolation transition 
with the announced characteristics. This is the case for instance if u is an en- 
tire function with nonnegative coefficients, or more generally if u is function 
with nonnegative coefficients such that u" is unbounded when the argument 
approaches the radius of convergence. It is worth to observe that if the y 
expansion of u has nonnegative coefficients, then the same is true of the 
X expansion of xv. Indeed, from eq.Q and the corresponding saddle point 
equation we infer that xto'{x) = z. Hence as functions of x, ro and z have 
the same singular points, and 

tt,'(x) = e""''(")u'(xe""''(")). 

Expand both sides of this identity to see that wi = 1 and that tVk+i — u^+i 
is a polynomial in Ui = 1, U2, ■ ■ ■ , Ufc, roi = 1, rD2, ■ ■ ■ ,XVk with nonnegative 
coefficients. 

In the case of our original model, the situation is more tricky. We know by 
construction that the x expansion of to has nonnegative coefficients, but to 
ensure the existence of a dilute regime, the same cannot be true in general of 
u. In the sequel, we shall see that in perturbation theory at any finite order, 

^This should be the case in statistical mechanics, and it is true by construction for our 
initial model as long as the parameters are real. 
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we are in the following situation : the coefficients of u may be negative, but 
nevertheless u'{y) is analytic (in fact a polynomial) and positive in a interval 
strictly containing and a solution of eq. (fTn|l . Then our previous arguments 
can be made rigorous and there is a (perturbative) percolation transition 
with the announced characteristics. We do not know if this argument can 
be extended outside the realm of perturbation theory. The numerical sim- 
ulations are encouraging, but the behaviour of some perturbative series is 
puzzling. Before discussing that, let us consider three simple but significant 
examples. 

6 Three easy examples 

6.1 The case of the Erdos-Renyi model 

Let us recover the Erdos-Renyi model in this framework. In that case, by 
construction, U{y) = u{y) = y and y* = a. Eq.^ leads to F = for all val- 
ues of a (no surprise, for the Erdos-Renyi model the weights are normalized 
as a probability distribution). Then eq.© leads to z = xe^, and from the 
Lagrange inversion formula, 



which are the classical (rooted and non rooted) tree generating functions (in 
fact, this gives a proof of Caley's formula for the number of trees). Note that 
if we use naively eq.®, we can deduce that F = only for a < 1. 

The number of connected components of size /c is ~ iV^-^Q;^~^e~'^", 
which is well-known to be true for fixed k and large N, for any value of a. 
Notice again that the use of u plays a crucial role in our approach. Using only 
ro, we would get the component distribution only for a < 1. In fact, for the 
corresponding random forest model (which is thermodynamically equivalent 
to the random graph model for a < 1) limjy^ao Nn/N is ^-^a'^~^e~'^" for 
a < 1 but is nonanalytic at a = 1, which is the percolation transition. 

The total number of points belonging to components of size /c is ~ 

For a < 1, X]fc ^^a^'e"^" = a, but for a > 1, Y,k ^^a'^e"''" = a, where 
a is the smallest solution to ae""" = ae~". The giant component occupies 
~ A^(l — a/a) sites. 




k 



k 
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6.2 The nested Erdos-Renyi model 

As another example, suppose that u^''^^\H) = p^(^) for all graphs, i.e that 
c, = > 1. Then w(^ff\G) = q''^''^ J:h,e(hKE(G)^ = i^qf^''^. Both 
weights describe the Erdos-Renyi model, but with different values for the 
probability of an edge. Going to the large N finite connectivity limit, we find 
u = ^k>i n^y^ ^ ^'^'^ from our previous analysis of the Erdos-Renyi model, 
we find that yu'{y) — X]fc>i ^^~^y^ ^^e Lambert function L{y)^ the solution 
of L{y)e^^^y^ = y analytic close to and vanishing at 0. Hence z = L(xe^), 
so that ze^^ = xe^. Hence 2z = L(2x). Moreover, from u(y) = L(y) — ^'^'^ 

we find 2x\3^'^^^\x) = L{2x) — ^^^^^ . So we recover the doubling of the edge 
probability when passing from the u^'^f^'' weight to the w^^^^^K The u^^^^'' 
percolation transition is at a = 1 but the w^^^^^ percolation transition occurs 
at q; = 1/2. Note that the equation y*u'{y*) = L{y) — a cannot be solved for 
a > 1, but that the free energy F = a/2 and the true connectivity c = 2a 
have an analytic continuation for larger a's. That this analytic continuation 
is the true value of F cannot in principle be decided from our arguments 
(we would have to do one more step of the same construction to view the 
ui^ff) weight itself as a composite weight). But this does not prevent us 
from finding and analysing correctly the w^^^^^ transition, because it occurs 
strictly before the ^('^•^•^^ transition. 

6.3 The matching model 

When Mfc = for /c > 2 we recover the Erdos-Renyi model, so let us try the 
next degree of difficulty, when = for k > 3 but U2 is a free parameter. 
Thus w'^^^^\G) is the generating function for a gas of disjoint egdes on G, 
that is, the generating function for (all, non necessarily maximal) matchings 
on G. This is a rather natural weight from the point of view of combinatorics. 
It is plain that the detailed structure of G is relevant, and not simply the 
size of its connected components. On the other hand, the u/^^f^ weight is 
nonzero only for a finite number of connected graphs, so that the function 

[/(^•^•^) is simply U^^^^^ ^ z + qu2^ and Zn = N\§ ^e'+i''^'^ . In such a 
simple case, the saddle point approximation applies without subtleties, and 
we retrieve, in the large N finite connectivity limit, the expected equations. 
The function tv{x) does not seem to be an elementary function. The small 
X and the perturbative small U2 expansions are straightforward but become 

2 

quickly ugly. However from u = y + M2^, we can easily find the percolation 
criterion. Parametrizing U2 = (with yc g]0, 1] for positive 1*2) and using 



22 



eq. (fTn|) . one finds that at the percolation threshold 



2 2 1 + 2/c 

So ttc decreases from 1 to 1/2 when U2 grows, but the physical average con- 
nectivity Cperc increases from 1 to 3/2. The special case M2 = 1 is of special 
combinatorial significance, because the weight w^'^^^^G) counts the number 
of configurations of non adjacent edges on G. Then i/c = 1/2, Uc = 3/4 and 
Cperc = 13/12. Consequently, Xc = \e~^^^, from which we can derive a result 
of direct combinatorial significance : 



N 



^ J] # matcMngs of T ^ ^''^^^ 

to be compared with YIt^Tm ^ ~ '^n\ ^ ~ '^W/^- Hence, if we put the 
uniform probability law on labelled trees of size N, the average number of 
matchings on a random tree of size N behaves like C** (^)^^^- 

7 Back to the original model 

7.1 Finite orders in perturbation theory 

Remember that we established in section 14.31 that 

where Y^h '^^^^ unlabelled connected admissible graphs (we could 

equivalently reason in terms of normalized connected sequences). Consider 
the coefficient of 11^^^'''^^'' ■ ™™ ^^^^ admissible graphs with 

solid /c-cycles and Uk solid segments on A; + 1 vertices. There is only a finite 
number of ways to join these fixed solid components with any number of 
complete dashed graphs. So the coefficient of Y[k^T''^k'' ^ polynomial in p 
and z. A fortiori, if we restrict to admissible graphs H such that f = / + 1, 
which are the ones contributing to u, the sum is finite, and the coefficent of 
Ylk^T'^^k'^ in the perturbative expansion of u{y) is a polynomial in y. 

Note that u{y) = y + 0{y^), so yu'{y) = y + 0{y^) and yu' (y) + y'^u" (y) = 
y + 0{y'^), where the 0{y'^) vanish to zeroth order in perturbation theory. 
Hence to any finite order in perturbation theory, yu'{y) is analytic and in- 
creasing up in a large value of y, but yu'{y) + y'^u"{y) = 1, the signal of the 
percolation transition, occurs at a value of y of order 1. Hence generically 
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to any finite order in perturbation theory our initial model exhibits a perco- 
lation transition described by our previous results. In the following we shall 
make explicit perturbative computations of the free energy, the percolation 
threshold, etc, for the special case tk = 0,Sk = 2ij,5k,z- To compare with 
the prediction of the Molloy-Reed criterion, we need first to show how to 
compute it in perturbation theory for our model. 

7.2 Moments of the degree distribution, Molloy-Reed's 
criterion 

By degree distribution of a given labelled graph G on iV vertices is meant the 
sequence (no(G), ■ ■ ■ ,nN-i{G)) where nj(G) is the number of vertices 
in G with exactly i neighbours. For fixed A^, the Molloy-Reed model concen- 
trates on the set of all those labelled graphs with a fixed degree distribution 
(no, ni, ■ ■ ■) and gives them uniform probability, see [H]. This represents a mi- 
crocanonical point of view in the sense that the degree distribution is fixed 
and can not fiuctuate. For a grand canonical presentation of the same idea, 
see [2]. If, for large N, {no/N,ni/N, ■ ■ ■) converges (in a sense made precise 
by Molloy and Reed) to a probability distribution (/o, /i, ■ ■ ■), a limiting ran- 
dom graph model is obtained, which depends only on (/o, /i, ■ ■ ■) and not on 
the details of the approximating sequence {no/N,ni/N, ■ ■ ■). We now recall 
the percolation criterion for the Molloy-Reed model with arbitrary degree 
distribution. 

For a given graph G, define k^'^\G) as the following average over vertices of 
G : 

1 ^ 

1=1 

For instance, when q = 1, Nk{G) = 2E{G). 

The statistical average {k'^) = ^-^^y^^y if (G)A;*^''^(G) is called the g-th 
moment of the degree distribution. Note that in the Molloy-Reed model, all 
graphs have the same degree distribution, so that (/c^) = k^'^\G) for all G in 
the relevant statistical ensemble. 

The Molloy-Reed percolation criterion states that the Molloy-Reed random 
graph has a giant component if and only if the two first moments of the 
degree distribution verify (/c^ — 2k) > 0. For the Erdos-Renyi model, {k) = 
a, {k^) = a{a + 1), leading to the percolation threshold a = 1. 
Our present purpose is to compute in perturbation theory the first moments 
of the degree distribution for our model. In principle, it is possible to compute 
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(fc"^) for any g > 1. In the definition 

GeSjv i,ji,---,jq 

of the g-th moment, e'^^'^'^^ ^ • ... a^-^ ■ ■ ■ aij^ may be viewed as the x deriva- 
tive taken at x = of exp (^Sf'\G, x) j = exp (^Sj{G) + x ... aij^ 



Seen as a new term of interaction, this exponential is still multiplicative and 
permutation invariant. We thus follow the steps which led us to eq.Q (see 
sec. I3.2^1 to prove that {k'^) is the derivative taken at x = of i^^illl ^ -^^qUM(z) 



■jq-z^, where Z}^ is the partition function of the model obtained from the 



original model by replacing Sj by 5'}^''. In the large N limit, we proceed just 
as in sec. 15.21 to show that Z^-* = ^^Fq+oiN) ^ being the new free energy : 
Fq = —1 — log ^ + ^u'^'^^d/g). In this expression, u*^*^^ is the tree generating 
function for the new model and y* is the corresponding saddle point. 
We now take the derivative and put x = to yield 

T i,jl,-,jq ^ '' 

Just as we did in the original model, we can use normalized sequences 
(or admissible graphs) to give a combinatorial interpretation of the over- 
lined term. A sequence iiji ■ ■ ■ injn is said to be normalized with respect to 
{mk,nk],q if 

• n = q + Y.k^i'^k + Uk), 

• 1 comes before 2, which comes before 3,... which comes before the 
number v of distinct elements among the sequence, 

• ilT^ jl, ■■■,in7^ jn, 

• it has a correct structure. That is, the sequence of the 2{n — q) first 
terms has a correct structure as regards Tr and || || and, moreover, 

^n— 1 ■ ■ ■ ^n— g+l- 

We put Mv,i,(mk,nk),q ^^i the number of such sequences. Finally, the g-th 
moment of the degree distribution is 

„ rk 1 jTm. ~n. 

(^') = - + - E ^T^Kl^.-noX" (11) 

a a ^-^ m,!n,! '■^'^ 

v,{m, ,n,} 
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s=2 
e=3 



v=3 
e=2 




s=2 
v=3 
e=3 



s=2 

v=2 



Figure 2: Graphical representation for the coefficients of S3. 



In particular, the Molloy-Reed's criterion can, in principle, be computed 
by means of this formula : it involves normalized sequences (of type (m^, Uk)) 
to which are concatenated subsequences of 2 elements for (k) or 3 elements 
for (P). 

We now study a simple example in which all quantities mentioned above 
can be explicitly (although perturbatively) computed. 

7.3 Perturbation theory : the example tk = 0, = 2/i(5fc 3 

This is the simplest non-trivial case for which 7^ 0. The weight of a graph 
G is 

According to our previous discussion, we assume that /i < 0. To get the 
percolation criterion, we use the general theory exposed in the preceding 
section. We set Al^ ^ = ■^n,n-i,m- The definition of y* is y*u'{y*) = a, i.e. 

* I \ ^ *nH' 1 xt 

y + yny 

^-^ ml 

n,m 

In fact, we were not able to find an expression of A^^^ valid for all 
n,m, and we rely on a direct enumeration, up to order 6, of the normalized 
sequences, see table [T] for the first five orders. At this moment, a fully 
automated enumeration algorithm starting from scratch and working in a 
reasonnable time would need too much memory. To have some control over 
possible errors coming from human input, we have checked our results with 
two independent algorithms. On a 2.5 Ghz processor, the computation of the 
fifth order takes about 5 minutes, but the sixth order takes about 8 hours : 
the growth in complexity is extremely rapid, at least factorial. 
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Table 1: 


M'm for m 


= 5. 





Up to third order in takes the following form: 

y* = Q;-2Q;2(a + l)(2a + l)/x + 2a2(^-l-17Q;-56Q;^-57Q;^-15Q;^ 
4 

+4a^)/i2 - -a^{l + 81a + 788a2 + 2485a^ + 3303a^ + 1808a^ 
+159a^ - 126Qi^ + 8a^)//3 + • • • 

The percolation criterion states that there exists a giant connected com- 
ponent if y*u'(y*) + y*'^u"{y*) > 1 and that, on the contrary, all connected 
components are of finite size if y*u'{y*) + y*'^u"{y*) < 1. With u{y) = 
y + J2nm-^nm^y"'7 boundary between the percolating region and the 
non percolating region is a curve in the (q;,/x) plane, of equation : 



..III 

a-l + J2n{n-l)y*-f^Mi 







(12) 



We can solve this equation for o; as a perturbative series in /i. Up to order 
5, this yields 



2 9500 3 49718 4 991328 5 41436164 g 



CKperc = 1 ~ 26// + 336// — /i H — /t — 
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Table 2: Enumeration of the sequences appearing in {k) . 



Putting a = ttperc in formula ^ we find that, at the percolation threshold, 
the mean number of neighbours of a given vertex is 

. .^ 2 652 , 19786 . 3498268 , 67025012 . 
Cperc = 1 - 10/i - 50yU —ji — /i — — /i fl H 

(14) 

In the preceding section, we saw how to infer the moments of the degree 
distribution from enumeration of the appropriate normalized sequences. Ta- 
bles [2] and (H] show the result of these enumerations for (A;) and (/c^). 
We compute (k) (either by means of formula ^ or using the enumeration 
[2] together with eq. lfTTll ) and {k"^) as perturbative series in /i, and then solve 
the equation {k"^ — 2k) = in a to find 

2 7324 o 28708 , 577988 , 
aMR = 1 - 24/i + 274/^2 —^^ + — — + ■ ■ ■ , 

which does not coincide with aperc- 



8 Discussion and perspectives 

In this paper, we have studied a class of perturbations of the Erdos-Renyi 
model which introduce correlations between the edges : the weight of a graph 
depends on the abundance of certain geometric features. 
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Table 3: Enumeration of the sequences appearing in {k"^) ■ 



To solve this model, we have introduced an auxiliary model whose tree 
generating function u was expected to present better convergence properties 
than the original one it). The free energy F in the large N limit has been 
determined and a percolation transition has been established by means of an 
effective model : the percolation criterion is given by an equation, either on 
F or on u. We also have formulae for the degree distributions. 

On the basis of these general results we give explicit formulae for the above 
quantities in the particular case where all parameters but one vanish. These 
perturbative results raise some crucial questions. Indeed, we hope that the 
thermodynamical model makes sense for /i < but that /i > has to be 
discarded because it gives too much weight to strongly connected configura- 
tions and cannot be treated like a diluted, tree-like, regime. In fact, up to 
sixth order, it is not so clear that the series for a^erc is actually convergent 
for negative /x, because its general term increases very fast. However, as sug- 
gested by the fifth and sixth terms, we hope that the following terms may 
all be negative, the series hence being possibly summable when < 0. This 
interpretation is supported by the form of the perturbative expansion of 
the physical connectivity parameter Cperc, which seems much better behaved, 
with negative coefficients for orders >0. 
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We also have computed the Molloy-Reed criterion, which does not give 
an appropriate description of the percolation transition in this model. The n 
expansion of aMR seems to present the same pathology as aperc- A possibility 
is that this series is indeed divergent for negative /i : the equation {k"^ — 2k) 
may not admit any solution in a as soon as /i < 0. Another possibility is that, 
just as for aperc the series may stop to alternate at higher orders. Anyway, it 
would be desirable to determine a class of models for which the Molloy-Reed 
criterion is valid, and we believe that a minimal requirement may be a kind 
of locality. Indeed, the Molloy-Reed criterion concentrates on the first two 
moments of the degree distribution, which are local quantities in the sense 
that k^'^\G) can be computed as soon as the immediate environment of each 
vertex is known, independently of how the vertices are connected to each 
other. Even in the simple model that we used to illustrate perturbation the- 
ory, this information is not sufficient to compute the weight of a graph : one 
must also know the immediate environment of the first neighbours of each 
vertex. 

Finally, we also believe that a more thorough understanding of degree corre- 
lations induced by attacks deserves a systematic treatment. 
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